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ABSTRACT 

Wavelet  shrinkage  de-noising  provides  a  new  way  to  reduce  noise  present  in  signals.  The  wavelet  has  ability  to 
analyze  different  parts  of  signal  at  different  scales.  The  author  reviews  the  wavelet  shrinkage  de-noising  principal  and 
different  thresholding  techniques.  This  paper  compares  the  performance  of  different  thresholding  technique  used  in 
wavelet  shrinkage  de-noising  for  speech  enhancement.  A  new  thresholding  technique  is  proposed  which  performs  better 
compared  to  existing  thresholding  technique.  The  enhanced  speech  quality  is  measured  using  metric  segmental-  SNR. 
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1.  INTRODUCTION 

Applied  scientists  and  engineers  who  work  with  data  obtained  from  real  world  know  that  signals  do  not 
exist  without  noise  [2,  11].  Under  the  ideal  conditions,  the  noise  might  decrease  to  such  negligible  levels  that  for 
all  practical  purposes,  de-noising  is  not  necessary.  Unfortunately,  we  usually  must  remove  the  noise  corrupting  a 
signal  to  recover  that  signal  and  proceed  with  further  data  analysis. 

Enthusiastic  supporters  have  described  the  development  of  wavelet  transform  as  revolutionizing  modern 
signal  and  image  processing  over  the  past  two  decades  [7].  Conservative  observers,  however  describes  this  new 
field  as  contributing  additional  useful  tools  to  growing  toolbox  of  transform.  A  particular  wavelet  method  called 
wavelet  shrinkage  de-noising  has  caused  its  zealous  advocates  to  claim  that  “it  offers  all  that  we  might  desire  of  a 
technique  [1,  6,  12],  from  optimality  to  generality”.  Inquiring  skeptics  however  might  be  loath  to  accept  these 
claims  based  on  asymptotic  theory  without  persuasive  evidence  from  real-  world  experiments.  Fortunately  a 
burgeoning  literature  is  now  addressing  these  concerns,  leading  to  more  realistic  appraisal  of  wavelet  shrinkage 
de-noising  utility. 

Wavelet  shrinkage  de-noising  provides  a  new  way  to  reduce  noise  in  signals.  Wavelet  shrinkage 
de-noising  should  not  be  confused  with  smoothing  (despite  the  use  by  some  authors  of  the  term  smoothing  as  a 
synonym  for  the  term  de-noising)  [2,  10].  Whereas  smoothing  removes  high  frequencies  and  retains  low 
frequencies,  de-noising  attempts  to  remove  whatever  noise  present  and  retain  whatever  signal  is  present  regardless 
of  the  spectral  component  of  noisy  signal.  For  example,  to  de -noise  music  corrupted  by  noise,  the  high  frequencies 
of  the  music  should  not  be  eliminated.  Instead  both  the  treble  and  bass  should  be  preserved.  Although  not 
demonstrated  here,  this  example  of  de-  noising  music  offers  an  important  application  of  wavelet  shrinkage 
denoising  for  further  investigation. 
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As  developed  originally  by  Donoho  et  al.  [3,  4,  5],  wavelet  shrinkage  de-noising  is  de-noising  by  shrinking 
coefficient  wavelet  transform  domain.  It  consists  of  three  steps 

•  A  linear  forward  wavelet  transform. 

•  A  non-linear  shrinkage  denoising. 

•  A  linear  inverse  wavelet  transform. 

Because  of  non-  linear  shrinking  of  coefficient  in  the  transform  domain,  this  procedure  is  distinct  from  those 
de-noising  methods  are  entirely  linear.  Moreover,  it  is  considered  a  non  -  parametric  methods,  including  both  linear  and 
non-linear  regression,  in  which  parameters  must  be  estimated  for  a  particular  model  that  must  be  assumed  a  priori. 
(For  example,  the  most  commonly  used  parametric  method  is  least  square  regression  to  estimate  the  parameters  ‘a’  and  ‘b’ 
in  model  y  =  a*x  +  b). 

This  paper  is  organized  as  follows.  Section  II  summarizes  the  wavelet  shrinkage  procedure.  Section  III  provides 
the  estimation  formula  for  threshold  value.  Section  IV  gives  segmental  SNR  computation  formula.  Section  V  describes  the 
mathematical  equations  for  different  thresholding  techniques.  Section  VI  provides  the  experimented  results. 
Section  VII  concludes  the  paper. 

2.  WAVELET  SHRINKAGE  DE-NOISING 

Assume  the  observed  data 
X(n)  =  S(n)  +  G(n) 

contains  true  signal  S(n)  with  additive  Gaussian  noise  G(n)  as  function  in  time  at  sample  points  ‘n’.  Let  W(  . )  and 
W-l(  .  )  denote  the  forward  and  inverse  wavelet  transform  operators.  Let  D(  .  ,  X)  denote  the  de-noising  operator  with 
threshold  th.  We  intend  to  wavelet  shrinkage  de-noise  X(n)  in  order  to  recover  S[n]  as  an  estimate  of  S[n]  Then  the  three 
steps 

Y  =  W(  X  ) 

Z  =  D(  Y,  th  ) 

S  =  W-l  (Z) 

summarize  the  wavelet  shrinkage  de-noising  procedure  [2]. 

The  wavelet  shrinkage  de-noising  algorithm  [13]  using  discrete  wavelet  transform  (DWT)  is  given  in  table  1. 


Table  1:  Wavelet  Shrinkage  De-Noising  Algorithm  [9] 


Wavelet  Shrinkage  De-Noising 

1. 

Select  wavelet 

2. 

Select  k  (  1  <  k  <  N)  decomposition  level  for  de-noising  of  the  coefficient  Detail  (Di)  where 
i  =  1,  2,  ...,  n,  when  N=  |  logl(L)\  ,  and  N  =  length(x)  that  x  is  discrete  noisy  speech 

3. 

Compute  decomposition  of  discrete  wavelet  transform  (DWT)  with  the  kth  level,  so  the 
output  of  this  step  is  k  Detail  components  and  kth  approximate  components 

4. 

Calculate  the  noise  threshold  for  k  Detail  components 

5. 

Apply  noise  threshold  to  the  k  selected  detail  component 

6. 

Inverse  DWT  of  k  Detail  components,  and  approximate  components 
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3.  THRESHOLD  VALUE  ESTIMATION 

Donoho  and  Johnstone  [4,  5]  proposed  threshold  th  for  wavelet  shrinkage  de-noising. 


Threshold  th  =  21ogN 

„  2 

where  a  is  estimation  of  noise  variance 
v 

N  number  of  samples 
For  Gaussian  White  Noise 


_  MAD 
7v  "  06745 


median 


W, 


D 


0.6745 


where  is  finest  scale  wavelet  coefficients. 


4.  SEGMENTAL  SNR  COMPUTATION 


The  segmental  SNR  is  computed  as: 

C  L—l  \ 

£s2[mL+l] 

_ M) _ 

L-l 

y,(  s[mL+ 1] — s  XmL+l]  )2 

v  /=o 

where  s[n]  is  original  signal 

s'[n]  is  noisy/de  -  noised  signal 

ML  is  the  length  of  signal 

L  is  the  segment  length  typically  10  to  20  ms 


,0*1-1 

SNR=  T,!10^ 


3 


5.  THRESHOLDING  TECHNIQUES 

5.1  Hard  Threshold 

Hard  thresholding  can  be  computed  as 

x  if  |x|  >th 
0  if  |x|  <  th 


y  = 


where 

x  is  the  original  signal 
y  is  the  thresholded  signal 
th  is  the  threshold  value 

Hard  thresholding  can  be  described  as  the  usual  process  of  setting  to  zero  the  elements  whose  absolute  values  are 
lower  than  the  threshold.  The  hard  thresholding  creates  discontinuities  at  x  =  (+  th). 


www.tivrc.org 


editor@tjprc.  org 


4 


Dr.  Shalini  Singh 


5.2  Soft  Threshold 

Soft  thresholding  [3]  can  be  computed  as 


>th 
<  th 

where 

x  is  the  original  signal 
y  is  the  thresholded  signal 
th  is  the  threshold  value 


sign(x)  |x|  —  th 


y  = 


if\A 
if  W 


Soft  thresholding  is  an  extension  of  hard  thresholding,  first  setting  to  zero  the  elements  whose  absolute  values  are 
lower  than  the  threshold,  and  then  shrinking  the  non-zero  values  towards  zero.  The  Soft  thresholding  does  not  create 
discontinuities  at  x  =  (  +  th). 

5.3  Super-Soft  Threshold 

Super-Soft  thresholding  is  computed  as 


y  = 


J'  x  -  sign(x)  (1  -  a)  th 

if  M 

ax 

if  x 

where 

x  is  the  original  signal 
y  is  the  thresholded  signal 
th  is  the  threshold  value 


CK  is  a  weight  factor  ( 0  <  a  <  1 ) 

The  Super-Soft  thresholding  is  a  modification  of  soft  thresholding.  Like  the  hard  thresholding  technique,  soft 
thresholding  also  set  the  values  to  zero  which  are  less  than  threshold  value  th,  that  results  in  voice  distortion.  For  wavelet 
shrinkage  speech  de-noising  soft  thresholding  technique  is  modified  by  F.  Nordstrom  [6],  They  introduced  a  factor  CC  to 
which  gets  multiplied  to  the  values  which  are  less  than  the  threshold  value  th.  If  CC  is  too  low,  distortion  is  introduced  to 
signal  and  if  it  is  too  high,  the  noise  is  not  reduced. 

5.4  Semi-Soft  Threshold 


Semi-Soft  thresholding  is  computed  as 


y 


sign(x)  | 

0 


th2 


till  -  thl 


|  -  thl ) 


if  | x |  >th\ 
ifth l<|x  <  th2 

if  lx  <  thl 
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where 

^  is  the  original  signal 
y  is  the  thresholded  signal 

th  1  and  th2  are  the  threshold  values,  th2  —  V 2  th\ 


The  semi-soft  thresholding  is  proposed  by  Gao  and  Bruce  [9].  This  thresholding  technique  avoids  discontinuities 
of  hard  thresholding  and  magnitude  reduction  of  soft  thresholding.  Here  a  second  thresholding  value  introduced,  namely 

th2  (  lh2  —  v/2  thl ).  Threshold  thl  is  estimated  from  noisy  signal  with  the  usual  method. 


5.5  Modified  Semi-Soft  Threshold 


We  propose  a  new  thresholding  technique  based  on  semi-soft  threshold  technique  by  introducing  a  factor  a  to 
which  gets  multiplied  to  the  values  which  are  less  than  the  threshold  value  thl.  Modified  Semi-Soft  Thresholding  is 
computed  as 


y 


< 


sign(x) 


thl 


thl  —  thl 


(|x|  —  r/rl) 


ifthl<\x\  <  th2 
if  Ixl  <  thl 


where 


x  is  the  original  signal 
y  is  the  threshold  signal 

thl  and  thl  are  the  threshold  values,  th2  —  ■\[2  thl 


CC  is  a  weight  factor  ( 0  <  OC  <  1 ) 

6.  EXPERIMENTAL  RESULTS 

The  performance  of  different  thresholding  techniques  is  compared  by  applying  wavelet  shrinkage  de-noising  on 
digital  speech  signals.  Following  procedure  is  adapted  for  a  performance  comparison: 

•  Digital  samples  of  speech  signal  are  taken  as  test  signal. 

•  Synthetic  noise  added  to  original  speech. 

•  Segmental  SNR  calculated  between  noisy  and  original  speech. 

•  Noisy  speech  processed  through  de-nosing  algorithm  to  get  de-noised  speech. 

•  Segmental  SNR  calculated  between  De-noised  and  Original  Speech 

•  The  result  of  one  test  speech  signal  is  given  in  following  below:  Test  speech  signal  of  duration  2  seconds,  sampled 
at  8  kHz  is  shown  in  figure  1.  Test  speech  signal  corrupted  by  additive  white  Gaussian  noise.  Noisy  speech  signal 
having  segmental  SNR  of  4.2039  dB  is  shown  in  figure  2.  De-noised  speech  using  modified  Semi-Soft 
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thresholding  is  shown  in  figure  3. 

The  result  of  one  test  speech  signal  is  given  in  following  below:  Test  speech  signal  of  duration  2  seconds,  sampled 
at  8  kHz  is  shown  in  figure  1.  Test  speech  signal  corrupted  by  additive  white  Gaussian  noise.  Noisy  speech  signal  having 
segmental  SNR  of  4.2039  dB  is  shown  in  figure  2.  De-noised  speech  using  modified  Semi-Soft  thresholding  is  shown  in 
figure  3. 


Figure  1:  Original  Speech  Signal 


Time,  ms 


Figure  2:  Noisy  Speech  Signal 


Figure  3:  De-Noised  Speech  Signal  Using  Modified  Semi-Soft  Threshold 

Segmental  SNR  of  De-noised  speech  for  different  thresholding  technique  is  shown  in  table  2. 
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Table  2:  De-Noising  Performance  Comparison 


S.  No. 

Thresholding  Type 

De-Noised  Speech 
Segmental  SNR  in  dB 

1. 

Hard  Threshold 

6.2148 

2. 

Soft  Threshold 

4.8321 

3. 

Super-Soft 

5.182 

4. 

Semi-Soft  Threshold 

5.8963 

The  wavelet  shrinkage  de-noising  using  modified  semi-Soft  thresholding  (alpha  =0.3)  provides  6.9186  db 
segmental  SNR,  which  is  maximum  compared  to  other  mentioned  thresholding  technique. 

Table  3:  Modified  Semi-Soft  Performance  for  Varying  Alpha 


S.  No. 

Alpha(a) 

De-Noised  Speech 
Segmental  SNR  in  dB 

1 

0.05 

6.1493 

2 

0.1 

6.3758 

3 

0.2 

6.7288 

4 

0.3 

6.9186 

5 

0.4 

6.9224 

6 

0.5 

6.7397 

The  experiment  was  repeated  on  different  speech  signals  with  varying  noise  power.  In  all  cases  Modified 

Semi-Soft  threshold  with  alpha  =  0.3,  provided  maximum  segmental  SNR  in  de-noised  speech. 

7.  CONCLUSIONS 

•  The  wavelet  shrinkage  de-noising  based  on  thresholding  proves  to  be  most  effective  procedure  for  speech 
enhancement.  The  experimentation  on  various  speech  signals  shows  that  the  proposed  modified  semi-soft 
thresholding  performs  best  compared  to  hard,  soft,  semi-soft  and  super-soft  thresholding  techniques. 
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